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r-{ \ Large graphs are sometimes studied through their degree sequen- 

ces (power law or regular graphs). We study graphs that are uniformly 
chosen with a given degree sequence. Under mild conditions, it is 
f^ , shown that sequences of such graphs have graph limits in the sense 

CO ' of Lovasz and Szegedy with identifiable limits. This allows simple 

determination of other features such as the number of triangles. The 
argument proceeds by studying a natural exponential model having 
the degree sequence as a sufficient statistic. The maximum likelihood 
estimate (MLE) of the parameters is shown to be unique and con- 
r^ . sistent with high probability. Thus n parameters can be consistently 



< 






c^ 






estimated based on a sample of size one. A fast, provably convergent, 
algorithm for the MLE is derived. These ingredients combine to prove 
the graph limit theorem. Along the way, a continuous version of the 
Erdos-Gallai characterization of degree sequences is derived. 



in 

\jQ • 1. Introduction. 

m ■ 

1.1. Graphs with a given degree sequence. Let G be an undirected sim- 
ple grapii on n vertices and let di , . . . , d„ be the degrees of the vertices 

Q . of G. The vector d := (di, . . . , d„) is usually called the degree sequence of G. 

(^ \ Correspondingly, the degree distribution of G is the probability distribution 

function F supported on [0, 1], defined as 



i7(2;):=K.'-'^*-"^}l 



n 

H : 

5^ , In other words, if a vertex is chosen uniformly at random, then the degree of 

that vertex, divided by n, is a random variable with probability distribution 
function F. 
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In recent years, the degree distributions of real world networks have re- 
ceived wide attention. The surveys [41, 42] contain many references as does 
the detailed account in [12] . The enthusiasm of some authors for "scale free" 
or "power law graphs" has also generated much controversy [33, 53] which 
serves as additional motivation for the present paper. 

The interest in degree distributions stems from the fact that the degree 
sequences of real world networks sometimes appear to have power law behav- 
ior that is very different than those occurring in classical models of random 
graphs, like the Erdos-Renyi model [21]. Researchers have tried various ways 
of circumventing this problem. An obvious solution is to build random graph 
models that are forced to give us the degree distribution that we want and 
then deduce other features by simulation or mathematics. A natural way to 
do this is to choose a graph uniformly at random from the set of all graphs 
with a given degree sequence. One frequent appearance of this model is for 
random regular graphs [54]. As explained in [12], Section 13, the model also 
arises in testing if the exponential family with degree sequence as sufficient 
statistic fits a given data set. See [51] for applications where the number 
of triangles is wanted. The paper [12] has useful ways of simulating graphs 
with a given degree sequence and an extensive survey of the (mostly non- 
rigorous) literature for this model. Some rigorous results are also available 
in the "sparse case," for example, those in [39, 40]. 

At this point, a gap between our motivation and our theory must be 
pointed out: the present paper deals with dense graphs with a given de- 
gree sequence (roughly, graphs whose number of edges is comparable to the 
square of the number of vertices), whereas much of the literature cited above, 
for example, power law graphs, revolves around sparse graphs. As of now, 
our theorems are not directly applicable in the sparse setting, although there 
is certainly hope for future progress. 

In a recent series of papers [5-10], Barvinok and Hartigan have looked 
at problems related to the structure of directed and undirected (dense) 
graphs with given degree sequence. The Barvinok and Hartigan work, espe- 
cially [10], is related to the present paper. This is explained at the end of 
this Introduction after we have stated our main theorems. 

One of the objectives of this article is to give a rather precise description of 
the structure of random (dense) graphs with a given degree sequence via the 
notion of graph limits introduced recently by Lovasz and Szegedy [34] and 
developed by Borgs et al. [13-15]. See also the related work of Diaconis and 
Janson [19] and Austin [2] which traces this back to work of Aldous [1] and 
Hoover [28]. This gives, in particular, a way to write down exact formulas 
for the expected number of subgraphs of a given type without simulation. 

Before stating our result, we need to introduce the notion of graph limits. 
We quote the definition verbatim from [34] (see also [14, 15, 19]). Let Gn 
be a sequence of simple graphs whose number of nodes tends to infinity. For 



RANDOM GRAPHS WITH A GIVEN DEGREE SEQUENCE 3 

every fixed simple grapli H , let | hom(if , G) \ denote the number of homomor- 
phisms of H into G [i.e., edge-preserving maps V{H) —^ V{G), where V{H) 
and V{G) are the vertex sets]. This number is normalized to get the homo- 
morphism density 

(1) ^^^^^>-\viG)\\ym- 

This gives the probability that a random mapping V{H) — )• V{G) is a ho- 
momorphism. 

Suppose that the graphs G„ become more and more similar in the sense 
that t{H, Gn) tends to a limit t{H) for every H . One way to define a limit 
of the sequence {Gn} is to define an appropriate limit object from which the 
values t{H) can be read off. 

The main result of [34] (following the earlier equivalent work of Aldous [1] 
and Hoover [28]) is that indeed there is a natural "limit object" in the form 
of a symmetric measurable function W : [0, 1]^ — >■ [0, 1] [we call W symmetric 
if W{x,y) = W{y,x)]. Conversely, every such function arises as the limit of 
an appropriate graph sequence. This limit object determines all the limits 
of subgraph densities: if i?^ is a simple graph with V{H) = [k] = {1, . . . ,k}, 
then 



t{H,w)= I rr w{xi,xj)dxi 



dxk- 

Here E{H) denotes the edge set of H. 

Intuitively, the interval [0, 1] represents a "continuum" of vertices and W{x, 
y) denotes the probability of putting an edge between x and y. For exam- 
ple, for the Erdos-Renyi graph Gn,p, if p is fixed and n — )■ oo, then the limit 
graph is represented by the function that is identically equal to p on [0, 1]^. 

Convergence of a sequence of graphs to a limit has many consequences. 
From the definition, the count of fixed size subgraphs converges to the right- 
hand side of the expression for t{H, W) given above. More global parameters 
also converge. For example, the degree distribution converges to the law of 
Jq W{U,y)dy where U is a random variable distributed uniformly on [0, 1]. 
Similarly, the distribution function of the eigenvalues of the adjacency matrix 
converges. More generally, a graph parameter is a function from the space of 
graphs into a space X which is invariant under isomorphisms. If A" is a to- 
pological space, we may ask which graph parameters are continuous with 
respect to the topology induced by graph limits. This is called "property tes- 
ting" in the computer science theory literature which has identified many 
continuous graph parameters. See the surveys [3, 14] for pointers to the lite- 
rature. 

We are now ready to state our result about the limit of graphs with given 
degree sequences. Suppose that for each n, a degree sequence d" = (d", . . . , d") 
is given. Without loss of generality, assume that di >d2 > ■ - ■ >dn- We say 
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that the sequence {d"} has a scaling limit if there is a nonincreasing func- 
tion / on [0, 1] such that 



(2) hm 



n— >oo \ 



n 



+ 



n 



1 " 

+ -E 



n \n 



0. 



It is not difficult to prove by a simple compactness argument that any 
sequence {d"} of degree sequences has a subsequence that converges to 
a scaling limit in the above sense. Note that convergence in the above sense 
can be stated equivalently in terms of convergence of degree distributions: 
(i"/n^'/(0), (i"/n^'/(l) and Dn/n ^- f{U) in distribution, where Dn is 
a randomly (uniformly) chosen d" and U is uniformly distributed on [0, 1]. 

The need to control di and d^ arises from the need to eliminate "outlier" 
vertices that connect to too many or too few nodes which takes the degree 
sequence too close to the Erdos-Gallai boundary (see below). Since d^ is 
decreasing in i, outliers can be eliminated by simply controlling d" and d". 
The need to eliminate outliers, on the other hand, arises from technical 
aspects of our analysis. 

Define L''[0, 1] to be the set of nonincreasing functions on [0,1] which 
are left continuous on (0,1). The reason for imposing left-continuity is the 
following: When the scaling limit of a degree sequence is discontinuous, it is 
not uniquely defined but there always exists a unique limit in L''[0, 1]. We 
could have as well chosen right-continuous. 

For each n, let Gn be a random graph chosen uniformly from the set of 
all simple graphs with degree sequence d*^. Let / be the scaling limit of the 
sequence {d'"} in the sense defined above. Our objective is to compute the 
limit of the sequence {Gn} in terms of the scaling limit of d". We endow the 
set of scaling limits (i.e., -D'[0, 1]) with the topology induced by a modified L^ 
norm || • ||i/ given by 

ll/lli' := 1/(0)1 + 1/(1)1 +r|/(x)|cix. 

Jo 

The choice of this norm is necessitated by the need to make it compatible 
with our previous notion of convergence of degree sequences. 

Not all functions can be scaling limits of degree sequences. Let J-" be the 
set of functions in D'[0, 1] that can be obtained as scaling limits of degree 
sequences in the sense stated above. By a simple diagonal argument, it is 
easy to see that J^ is a closed subset of D'[0, 1] under the topology of the 
modified L^ norm. It is shown in Proposition 1.2 that J- has nonempty 
interior. 

Theorem 1.1. Let Gn and f be as above. Suppose that f belongs to the 
topological interior of the set T defined above. Then there exists a unique 



RANDOM GRAPHS WITH A GIVEN DEGREE SEQUENCE 
function g:[0,l] — ^ M in D'[0, 1] such that the function 



W{x,y):-- 



l _)_ e9{x)+g{y) 

satisfies, for all x € [0, 1], 

f{x)= / W{x,y)dy. 
Jo 

In this situation, the sequence {G„} converges almost surely to the limit 
graph represented by the function W . 

Theorem 1.1 can be useful only if we can provide a simple way of checking 
whether / belongs to the interior of J^. (Being the limit of a sequence of 
degree sequences, it is clear that f £ J-. The nontrivial question is whether / 
is in the interior.) The following result gives an easily verifiable equivalent 
condition. 

Proposition 1.2. A function f: [0, 1] -^ [0, 1] in D'[0, 1] belongs to the 
interior of J- if and only if: 

(i) there are two constants ci > and C2 < 1 such that c\ < f{x) < C2 for 

all x G [0, 1] and 
(ii) for each x € (0, 1], 

fi rx 



pi px 

/ min{/(y),a;}<iy + x^- / f{y)dy>0. 
Jx Jo 



Remark 1. Condition (ii) in the above result is a continuum version of 
the well-known Erdos-Gallai criterion [22]: Suppose di > ^2 > • • • > dn are 
nonnegative integers. The Erdos-Gallai criterion says that di, . . . ,dn can be 
the degree sequence of a simple graph on n vertices if and only if ^"^^ di is 
even and for each 1 <k <n, 

k n 

/ di<k{k — l)+ y uim{di,k}. 

i=l i=k+l 

(See [35] for extensive discussions and eight equivalent conditions.) 

Remark 2. When the scaling limit / is continuous, convergence in the 
modified L^ norm is the same as supnorm convergence. In particular, for 
continuous scaling limits Theorem 1.1 and Proposition 1.2 both hold if we 
replace D'[0, 1] with C[0, 1] and redefine J^ analogously under the supnorm 
topology. 
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Remark 3. As an example, consider the limit of the Erdos-Renyi graph 
G{n,p) as n — )• oo. Here f{x) =p for all x. Condition (ii) becomes (1 — 
x) min{p, x} + x'^ — px > for all x. Considering the two cases x > p and 
X < p it is easy to see that this holds, so Erdos-Renyi graphs are in the 
interior of J-" for any fixed p, < p < 1. 

Remark 4. In a recent article [37] (following up on the older work [38]), 
McKay has computed subgraph counts in random graphs with a given degree 
sequence. However, McKay's results hold only if either the graph is sparse 
or the graph is dense but all degrees are within n^' ^^^ of the average degree. 
Thus, it may be possible to recover Theorem 1.1 from McKay's results when 
the limit shape is a constant function but not in other cases. 

The next natural question is whether one can feasibly compute the func- 
tion g in Theorem 1.1 for a given /. It turns out that this is a central issue 
in the whole analysis. In fact, to prove Theorem 1.1 we analyze a related 
statistical model; computation of the maximum likelihood estimate in that 
model leads to an algorithm for computing g which, in turn, yields a proof 
of Theorem 1.1. The statistical model is discussed next. 

1.2. Statistics with degree sequences. Informally, if the degree sequence 
captures the information in a graph, different graphs with the same degree 
sequence are judged equally likely. This can be formalized by saying that 
the degree sequence is a sufficient statistic for a probability distribution 
on graphs. The Koopman-Pitman-Darmois theorem forces this distribution 
to be of exponential form. This approach to model building is explained 
and developed in [32]. Diaconis and Preedman [17] give a version of the 
Koopman-Pitman-Darmois theorem for discrete exponential families. The 
approach is also standard fare in statistical mechanics where the uniform 
distribution on graphs with fixed degree sequence is called "micro-canonical" 
and the exponential distribution is called "canonical" (see [43]). It turns out 
that the exponential model has a simple description in terms of independent 
Bernoulli random variables. 

Given a vector /3 = (/3i, . . . , /3„) G M"-, let P^ be the law of the undirected 
random graph on n vertices defined as follows: for each 1 < ? 7^ j < n, put 
an edge between the vertices i and j with probability 

independently of all other edges. Thus, if G is a graph with degree sequence 
di,. . . ,dn, the probability of observing G under P^ is 
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Henceforth, this model of random graphs is called the "/3-model." This 
model was considered by Holland and Lienhardt [27] in the directed case 
and by Park and Newman [43] and Blitzstein and Diaconis [12] in the undi- 
rected case. It is a close cousin to the Bradley-Terry model for rankings 
[which itself goes back (at least) to Zermelo]. See [29] for extensive refer- 
ences. The /3-model is also a simple version of a host of exponential models 
actively in use for analyzing network data. We will not try to survey this 
vast literature but recommend the extensive treatments in [30, 41, 48]. The 
website for the International Network for Social Network Analysis contains 
further information. 

Suppose a random graph G is generated from the /3-model where (3 € M" 
is unknown. Is it possible to estimate (3 from the observed G? It is not 
difficult to show that the maximum likelihood estimate (MLE) /3 of /3 must 
satisfy the system of equations 



j^i 1 + e^^+P^ 



,n. 



where di,...,dn are the degrees of the vertices in the observed graph G. 
Questions may arise about the existence, uniqueness and accuracy of the 
MLE. Since the dimension of the parameter space grows with n, it is not 
clear if this is a "good" estimate of /3 in the traditional sense of consistency 
in statistical estimation theory. 

The following theorem shows that under certain mild assumptions on /3, 
there is a high chance that the MLE exists, is unique and estimates /3 with 
uniform accuracy in all coordinates. 

Theorem 1.3. Let G be drawn from the probability measure P^ and let 
di,...,dn be the degree sequence of G. Let L := maxi<j<„, |/3j|. Then there 
is a constant C{L) depending only on L such that with probability at least 
1 — C{L)n~'^ , there exists a unique solution j3 of the maximum likelihood 
equations (3), that satisfies 



im:K\h-Pi\<C{L)\r^^. 

l<i<n V n 

It may seem surprising that all n parameters can be accurately estimated 
from a single realization of the graph. However, one needs to observe that 
there are, in fact, n{n — l)/2 independent random variables lurking in the 
background (namely, the indicators whether edges are present or not). There 
is a well-known heuristic that in a p-parameter model with m observations, 
"the usual asymptotics" work provided that p^ /m tends to zero as m tends 
to infinity. See [44-47] for details (and counter examples). In our model p = n 
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and m = n{n — l)/2, so p^/m does not tend to zero but stays bounded. The 
heuristic, although not directly applicable, hints at a reason why one can 
expect estimability of parameters. 

In work closer to the present paper, Simons and Yao [50] studied the 
Bradley-Terry model for comparing n contestants. Here a random orienta- 
tion of the complete graph on n vertices is chosen based on "player a beats 
player b with probability e{a)/[9{a) + 6'(6)]." They show that MLE is con- 
sistent here as well. Hunter [29] shows that the MM algorithm also behaves 
well in this problem. 

The next theorem characterizes all possible expected degree sequences of 
the /3-model as /3 ranges over M". The nice feature is that no degree sequence 
is left out. 

Theorem 1.4. Let TZ denote the set of all expected degree sequences of 
random graphs following the law P/3 as (3 ranges over M" . Let D denote the 
set of all possible degree sequences of undirected graphs on n vertices. Then 

conv(P) = TZ, 

where con.v(T>) denotes the convex hull ofT> and IZ is the topological closure 
ofU. 

Incidentally, the convex hull of P is a well-studied polytope. For example, 
its extreme points are the threshold graphs. (A graph is a threshold graph 
if there is a real number S and for each vertex v a real vertex weight w{v) 
such that, for any two vertices f,n, {u,v), there is an edge if and only if 
w{u) + w{v) > S. See [35] for much more on this.) 

A self-contained proof of Theorem 1.4 is given in Section 3. However, 
it is possible to derive it from classical results about the mean space of 
exponential families (see, e.g., [16] or [4]; in particular, see [52], Theorem 3.3). 

Finally, let us describe a fast algorithm for computing the MLE if it exists. 
Recall that the L°° norm of a vector x = [xi, . . . , x„) is defined as 

^ 00 • ~~ niax x^ . 

l<i<n 

For l<i^j<n and x G M", let 

(4) r,j(x 



e ^^ + e^ 



Given a realization of the random graph G with degree sequence di,. . . ,dn, 
define for each i the function 

(5) 99j(x) := log di - log^rij(x). 
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Let (p:W^ — >■ M" be the function whose ith component is (pi. An easy re- 
arrangement of terms shows that the fixed points of (p are precisely the 
solutions of (3). The following theorem exploits this to give an algorithm for 
computing the MLE in the /3-model. 

Theorem 1.5. Suppose the ML equations (3) have a solution (3. Then (3 
is a fixed point of the function ip. Starting from any xq G M", define x^+i = 
(p{xk) for A; = 0, 1, 2, ... . Then x^ converges to (5 geometrically fast in the L°° 
norm where the rate depends only on (|/3|oo) |xo|oo)- -^^ particular, (3 must 
he the unique solution of (3). Moreover, 

|xo-^|oo <C'|xo-Xi|oo, 

where C is a continuous function of the pair (|/3|ooi |xo|oo)- Conversely, if 
the ML equations (3) do not have a solution, then the sequence {x^} must 
have a divergent subsequence. 

There are many other algorithms available for calculating the MLE. For 
example, Holland and Leinhardt [27] use an iterative scaling algorithm and 
discuss the method of scoring and weighted least squares. Hunter [29] de- 
velops the MM algorithm for a similar task. Markov chain Monte Carlo 
algorithms and the Robbins-Monro stochastic approximation approach are 
also used for computing the MLE in exponential random graph models. 
See [31], Section 6.5.2, for examples and literature. The iterative algorithm 
we use is a hybrid of standard algorithms which works well in practice and 
allows the strong conclusions of Theorem 1.5. We hope that variants can be 
developed for related high dimensional problems. 

Let us now look at the results of some simulations. The left-hand panel in 
Figure 1 shows the plot of /3j versus /3.j for a graph with 100 vertices, where 
Pi,...,/3n were chosen independently at uniform from the interval [—1,1]. 
The right-hand panel is the same, except that n has been increased to 300. 
The increased accuracy for larger n is clearly visible. 

We have also compared our results with the simulation results from the 
importance sampling algorithm of Blitzstein and Diaconis [12] for a variety 
of other examples. The results of Figure 1 are typical. This convinces us that 
the procedures developed in this paper are useful for practical problems. 

Comparison to the Barvinok and Hartigan work. As mentioned before, 
the present work is closely related to a recent series of papers by Barvinok 
and Hartigan [5-10]. The work was initiated by Barvinok who looked at 
directed and bipartite graphs in [6]. In their most recent article [10] (up- 
loaded to arXiv when our paper was near completion), they study uniform 
random (undirected) graphs on n vertices with a given degree sequence 
d = (di, . . . , dn) and work with an exponential model as in Section 1.2 with /3j 
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(a) n = 100, /5i~ Uiiif[-l,l]. (b) n = 300, /3i ~ Unif[ -1, 1]. 

Fig. 1. Simulation results: plot of Pt vs. Pi. 



chosen so that the expected degree at i under the /3-model is di. Let G^ be 
a uniformly chosen random graph with the degree sequence d and G/3 be 
a random graph chosen from the /3-model. One of their main results shows 
that (under hypothesis) these two graphs are close together in the following 
sense: Fix a set of edges S in the complete graph on n vertices. Let X^ 
be the number of edges of Gd in S. Let X^ be the number of edges of G^ 
in S. They prove that X^/'n? and X^jr? are each concentrated about their 
means (using results from the earlier work [5]) and that these means are ap- 
proximately equal. Their theorem is proved under a condition on the degree 
sequences that they call "delta tame." 

While the two sets of results (i.e., ours and those of Barvinok and Harti- 
gan) were proved independently and the methods of proof are quite different 
in certain parts (but similar in others), the possible connections are tantaliz- 
ing. We believe that their mode of convergence (Gd and G^ contain about 
the same number of edges in a given set) is equivalent to the graph limit con- 
vergence used here. Perhaps this can be established using the "cut-metric" 
of Frieze and Kannan, as expounded in [14]. We further conjecture, based 
on Lemma 4.1 in this paper, that their delta tame condition is equivalent 
to our condition that the limiting degree sequence / is in the interior of T . 
If this is so, then Proposition 1.2 (or more accurately. Lemma 4.1) gives 
a necessary and sufficient condition for a degree sequence to be delta tame, 
showing that essentially all degree sequences except the ones close to the 
Erdos-Gallai boundary are delta tame. 

In summary, the Barvinok and Hartigan work [10] contains elegant esti- 
mates of the number of graphs with a given degree sequence and extensions 
to bipartite graphs under a condition called delta tameness; we work in the 
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emerging language of graph limits and prove a limit theorem under a contin- 
uum version of the easily verifiable Erdos-Gallai criterion. Our work contains 
an efficient algorithm for computing the maximum hkelihood estimates of /3 
for a given degree sequence with proofs of convergence of the algorithm and 
consistency of the estimates. 

The rest of the paper is organized as follows. In Section 2 we prove The- 
orem 1.5. This is followed by the proof of Theorem 1.4 in Section 3. Both 
of these theorems are required for the proof of Theorem 1.3, which is given 
in Section 4. Proposition 1.2 is proved in Section 5. Finahy, the proof of 
Theorem 1.1, which uses all the other theorems, is given in Section 6. 

2. Proof of Theorem 1.5. For a matrix A = {aij)Kij<:n, the L°° operator 
norm is defined as 

K^ oo -^ max AxLq. 

|a:|oo<l 

It is a simple exercise to verify that 



n 



\A\co = max y 



l<i<n- 

Given (5 > 0, let us say the matrix A belongs to the class Cn{5) if |^|oo ^ 1 

and for each l<i^ j <n, 

s 

an > 5 and aij < . 

n— 1 

Lemma 2.1 is our key tool. 

Lemma 2.1. Let Cn{S) be defined as above. If A,B^Cn{5), then 

,, , 2(n-2)52 

\AB\^<1-^ ^. 

n — \ 

Proof. Fix l<i^k<n. By the definition of Cn{5), 



CLijbjk > —, 7T9- and aubik + ai^b^k < ^ • 

■' (n — ly n— 1 

Now, if a;,y are two positive real numbers, then |x — y| = |x|-|-|y| — 2min{a;,y}. 
Taking x = Ej^{i,fc} (^ij^jk and y = -{aubn, + aikbkk), we get 



n 

^ai^bjk 


n 


-2min<^ ^ aijbjk, 


-{aiibik + aikbkk 


i=i 


i=i 


^JtiiM 






n 

- y ]\o-ijbjk\ 


2(n - 2)6^ 
{n-lf ■ 
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Combining this with the hypothesis that \A\oo < 1 and |-B|oo ^ 1; we get 

n 

\AB\oo = max > 

l<j<n'^-' 



fc=l 
71 n 



2_^aijbjk 



^i^fiiEEl«*^-^^-fcl 



<1 



The proof is complete. D 



j=i k=i 
n— 1 



2(n - 2)6^ 
n-1 



Now recall the functions rjj defined in (4). Let 

Note that for each i and x, ^j-nQijix) = 1. Again, for each 
dxj ^ • -^ ■ ■ ^-^ 



e ^ 



and similarly for each distinct i and j, 

dxj 
Now, if |x|oo < K, then clearly 

Thus, 



e -J + & 



■%■ 



e J + e^ 



r^ii- 



^e"-^ < rij(x) < ie^ for ah 1 < i / j < n. 



-2ii' 



2_fs: 



n 



- < gii(x) = ^^--^ ^ _ 



< 



It follows that for every 1 < « 7^ i < n 

dipi 



r,2K 



-iK 



<-Ll< 



^ ^ n-l^ dxj ^ 2{n- 1) 

and also, for every 1 < i < n. 



(7) 



1 4^- < ^ < p2ir 

2 -9x,- ■ 



Now take any x, y G M" and let K be the maximum of the L°° norms of x, 
y, </?(x) and ip{y). Let J(x,y) be the matrix whose (z,j)th element is 

'1 



Ji,(x,y) = y" |^(fx+(l-t)y)dt. 
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It is a simple calculus exercise to verify that 

(8) 99(x)-v?(y)=J(x,y)(x-y). 

For each i^ j, dipi/dxj is negative everywhere and for each i, dipi/dxi is 
positive everywhere. Moreover, for each i, 



dipi 



dxj 
j=l 3 



dipi s-^ difi 



EU^i _ U^i ■^ Ui[Ji _ 

r)i" • Finn; ^—^ FIt ; 



dXi ^-^ dXj 



It follows that for i and any x,y, YTj=\ l'^ii(^'y)l = 1- In particular, |J(x, 
y)|oo = 1- From (6) and (7) and the fact that |J(x,y)|oo = 1, we see that 
J(x,y) ^ C,n{S) for (5= \e~'^^ . Similarly, 

v(</'(x)) - y'(v?(y)) = J((/7(x),v5(y))((/9(x) - v7(y)) 

= «^('/'(x),¥'(y)) J(x,y)(x - y) 

and J(v3(x),(/5(y)) G C.n{S) also. Applying Lemma 2.1, we get 

|J(^(x), v.(y))J(x,y)U < 1 - ^^''-'Y . 

n — v 

Thus, 

(9) |<^((p(x)) - (p((/.(y))|oo < (i - ^^^3Y^) 1^ - yloo- 

The quantity inside the brackets will henceforth be denoted by 0(x, y). Note 
that < ^(x, y) < 1 and Q is uniformly bounded away from 1 on subsets of 
M" X R". Moreover, since |J(x, y)|oo = 1, we also have the trivial but useful 
bound 

|c/?(x)-99(y)|oo < |x-y|oo. 

Now suppose ip has a fixed point /3. If we start with arbitrary xq and define 
x/j_|_i = (p{xk) for each k>0, then for each k, we have 

|xfc+i - f3\oo = l^i^k) -^pi(3)\oo < |xfc-^|oo. 

In particular, the sequence {xk}k>o remains bounded. Therefore, by (9), the- 
re is a single ^ € [0, 1), depending only on |/3|oo and |xo|oo in a continuous 
manner, such that for all k>0, we have 

(10) |xfc+3 — Xfc+2|oo < ^|xfc+l — Xfcloo 

and 

|xa.-+2-/9|oo <6'|xfc-^|oo. 



14 
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The second inequality shows that x^ converges to /3 geometricahy fast and 
the first inequahty gives 



|xo — /3|oo < 2_^ l-^'^ ~ Xfc+iloo 
fc=0 

1 



< 



1 



Xo-Xi|oo + |xi -X2I00) 



< 



1 



|xo-xi| 



Finally, note that if /3 does not exist, then the sequence {x^} must have 
a divergent subsequence. Otherwise, (9) would imply that (10) must hold 
for all k for some £ [0, 1). This, in turn, would imply that x^ must converge 
to a limit as k ^ 00, which would then be a fixed point of <p and, therefore, 
a solution of the ML equations. The proof is complete. 

Before moving to the next section we will prove a technical lemma which 
will be of use in the proof of Theorem 1.1 based on the above calculations. 

Lemma 2.2. Let x,y eW such i/iai max{|x|oo, |y|oo} < -f^'- Then 
|(^(x)-^(y)|i<2e2^|x-y|i, 
where | • |i is the usual L^ norm on M". 



Proof. By equation (8), 

\(p{x) - (f{y)\i = |J(x,y)(x-y)|i 



E 



5Z(^j- 



%■; 



|^(tx + (l-i)y)di 



dxi 



n / " 

<^\xj-yj\- [Y^ sup 



j=i 



n 



a=i*e[0'i] 



|i(*x + (l-%) 



2e2^|x-y|i, 



<^2e2^|x,-y,| 

where the second inequality follows from equations (6) and (7). D 

3. Proof of Theorem 1.4. We need the following simple technical lemma. 

Lemma 3.1. Suppose f :W^ -^M is a twice differentiable function such 
that M := sup^giRn /(x) < 00. Let V/ and V^/ denote the gradient vector 
and the Hessian matrix of f and suppose there is a finite constant C such 
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that the L^ operator norm of V^/ is uniformly hounded by C . Then for 

any x E M" , 

|V/(x)|2<2C(M-/(x)), 

where \ ■ \ denotes the Euclidean norm. In particular, there exists a sequence 
{xfc}fc>i such that linifc^^oo V/(xfc) = 0. 

Proof. Fix a point x G M" and let y = Vf{x.). Suppose C is a uniform 
bound on the L^ operator norm of V^/. Then for any i > 0, 

(11) |V/(x + ty)-V/(x)j<Ct|y|. 

Now let g{t) = /(x + ty). Then for all t, 

g{t)-g{0)<M-f{x). 
Again, note that 

5'(i) = (y,V/(x + ty)) 

= (y, V/(x + ty) - V/(x)) + (y, V/(x)) 
>-Ct|yp + |y|2. 

[The last step follows by (11) and Cauchy-Schwarz.] Thus, for any t > 0, 

M - /(x) > g{t) - g{0) = [ g'{s) ds > \y\^ / (1 - Cs) ds. 

Jo Jo 

Taking t = 1/C gives the desired result. D 

Proof of Theorem 1.4. Let 5^ = (51 , . . . , g'n) : M" -> M" be the function 
defined as 

; — , i = 1, . . . ,n. 

Then TZ is the range of g. This is because the expected degree of vertex i of 
a random graph following the law Px is gi{x). In particular, the vector ^(x) 
is a weighted average of degree sequences and hence, 

conv(i:') 5 n. 

Now, for every y G M", let /y : E" ^^ M be the function 

n 

/y(x) = ^x,y,-log Y. (l + e"^+"0- 

i=l l<i<jf<n 
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Note that under ¥^, the probabiHty of obtaining a given graph with degree 
sequence d = (di, . . . , d„) is exactly 



Thus, the above quantity must be bounded by 1 and hence, taking logs, we 
get /rf(x) < 0. Since /y(x) depends linearly on y, this implies that 

/y (x) < for ah y € conv(P), x € M". 

Now fix y G conv(2?). Then /y(x) < for all x G W\ Moreover, it is easy to 
show that V^/ is uniformly bounded. Hence, it follows from Lemma 11 that 
there exists a sequence {:x.k}k>i such that limfc_^oo V/y(xfc) = 0. But 

V/y(x)=y-<7(x). 

Thus, y = lim^^oo (7(xjt). This shows that 

conv(P) C ^ 

and hence, completes the proof of the claim that conv(D) =TZ. D 

4. Proof of Theorem 1.3 (Existence and consistency of the MLE). This 
section uses the notation of Section 1 without explicit reference. The proof 
consists of two lemmas. The first lemma gives a condition for the "tightness" 
of the MLE. This result is closely related to the Erdos-Gallai characteriza- 
tion of degree sequences. The second lemma shows that conditions needed 
for the first lemma are satisfied with high probability. An addenda at the 
end of the section contains some results about existence of the MLE and the 
closely related topic of conjugate Bayesian analysis. 

In this section we will repeatedly encounter statements like "c is a positive 
constant depending only on a, 6, . . . ." Such a statement should be interpreted 
as "c can be expressed as a function of a, 6, . . . that is bounded away from 
and oo on compact subsets of the domain of (a, 6, ...)." Sometimes, c will 
be expressed as C{a, b, . . .). 

Lemma 4.1. Let (di, . . . ,dn) be a point in the set TZ of Theorem 1.4- 
Suppose there exist ci,C2 S (0, 1) such that C2(n — 1) < dj < ci(n — 1) for 
all i. Suppose C3 is a positive constant such that 

\ inf |Vmin{d,-,|i?|} + |S|(|i?|-l)-VdA>C3. 

Then a solution /3 of (3) exists and satisfies l/3|oo < C4, where C4 is a constant 
that depends only on ci, 02,03. 
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Proof. In this proof, C(ci, 02,03) denotes positive constants that de- 
pend only on ci, C2, C3, in the sense defined above. The argument repeatedly 
uses the monotonicity of e^"'"^/(l + e^"*"^) in x for each y. 

Assume first that /3 exists in the sense that there exists /3 € M" such 
that (3) is satisfied. Let ci, 02,03 be as in the statement of the lemma. It is 
proved below that |/3|oo is bounded above by (7(01,02,03). 

Let (imax := maxjdj and dmm '■= minjtij. Similarly, let /Smax := maxj/3j 
and /?min :=niinj/3j. The first step is to prove that /?max < C (01,02,03). If 
/3max < 0, there is nothing to prove. So assume that /3max > 0. Let 

m:= \{i:^i > -^/3max}|- 

Clearly, by the assumption that /3jnax > 0, it is guaranteed that m > 1. Let i* 
be an index that maximizes /3j. Then by (3), we see that 

p(l/2)/3max 

dma^>di* > (m- 1)- 



1 + e(V2)/3max 

This implies 

n-m>n-l- (imax(l + e"^^/^^'^™"'') > n - 1 - oi(n - 1)(1 + e~(i/2)^"^="'). 

In particular, this shows that if /3max > C'(ci) then m <n and hence, there 
exists i such that /3j < — |/3max- Suppose this is true and fix any such i. (In 
particular note that /3j < 0.) Let 

m■■=\{j■■jJ^iJj<-^i3^}\■ 
Then by (3), 

e(V2)ft 

dmin <di< rrii + n - 1 - mj, 

1 + e(V2)ft 

which gives 

mi < (n - 1 - d^in)(l + e(i/2)ft) < (^ _ i)(i _ C2)(l + e-(i/4)/3max)_ 

Note that there are at least n — mi indices j such that /3j > —3 A > j/^max- 
The last display implies that if /3max > (7(01, 02), then there exists i such 
that n — mi>bn, where 

b:=cl 

Consequently, if /3max > (7(01, 02), there is a set ^4 C {1, ... , n} of size at least 
bn such that 13 j > j/3max for all j G ^4, where 6 = 02. Henceforth, assume 
that /3max is so large that such a set exists. Let 



/i:=V/3i 
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For each integer r between and j^h — 1, let 

Dr:= <t: \-rh<Pi< \-{r+l)h>. 



Since Do,Di, . . . are disjoint, there exists r such that 

Dr < 



(l/16)/i-l' 

provided h > 16. By assumption, /3max > C{ci,C2)- Since we are free to choose 
C(ci, C2) as large as we like, it can be assumed without loss of generality that 
h>16. 

Fix such an r between and j^h — I. Let 

B:={,:ft>%^-(r+i).}. 

Clearly, the set B contains the previously defined set A and hence, 
(12) \B\>bn. 

Now, for each i 7^ j, define 



Pij ■■-- 



For each i, let 

jeB\{t} 
Since /3j > ^-^^ for each i ^ B, it follows that 

\B\{\B\-l)-Y,df = \B\{\B\-l)- Yl P^i 

ieB i,j<^B,i^j 

(13) = Y. (1-^^j) 

i,j&B,iytj 

^ \B\i\B\-l) 

~ 1 + e(V8)/9max ' 

The above inequality is the first step of a two-step argument. For the second 
step, take any j ^ B. Consider three cases. First, suppose $j > — fiaai + (r + 

\B\ 



l)h. Then for each i ^ B, J3i+ fij>ij and, therefore. 



iK,|i?|}-df < \b\-Yp^, = Y^^ -p^^) ^ i^' 



mm-l J _ ^ ^^ _ ^ 

ieB ieB 
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Next, suppose /3j < -Ss^ + rh. Then for any i ^ -B, /3i + /3j < -|. Thus, 
min{dj ,\B\} - df < dj - df = Y^ pij < ne"^/^ . 

Finally, the third case covers all j ^ B that do not fall in either of the 
previous two cases. This is a subset of the set of all j comprising the set Dr- 
Combining the three cases gives 

(14) X:(„„n{d„ m - 4) < ^ + nh-"'^ + i^. 

But 

jfB iGBJfB i&B 

Thus, adding (13) and (14), 

^min{rf,-,|S|} + |B|(|S|-l)-J]di 
j4B ieB 

(15) 



1 + e(V8)/3max 1 + e'^/^ /i - 16 ■ 

The left-hand side of the above inequality is bounded below by c^n'^, by 
the definition of C3 in the statement of the theorem. The coefficient of n^ 
on the right-hand side tends to zero as /3max —^ 00. This shows that /3max ^ 
C(ci, C2, C3), where the bound is finite since C3 > 0. Next, note that for any i, 



di< 



onpPi+Pn 



]_ _|_ gft+/3max 

and, therefore, if i** is a vertex that minimizes /3j, then 



'^min ^ Cli** ^ 



^pPminTPmax 
1 -1- gPmin+Pniax 



Combined with the upper bound on ^max and the lower bound on dmirn this 
shows that /3ram > —C{ci, 02,03). 

To complete the proof of the lemma, it must be proved that f3 exists. Since 
(di, . . . , dn) G TZ, by Theorem 1.4 there is a sequence of points {xk}k>o in I^" 
that converge to {di, . . . ,dn) for which solutions to (3) exist. Let {Pk}k>o 
denote a sequence of solutions. The steps above prove that |/3fc|oo < C for 
all large enough k where C is some constant depending only on ci , C2 , C3 . 
Therefore, the sequence {(3j.}k>o must have a limit point. This limit point 
is clearly a solution to (3) for the original sequence di, . . . ,dn- □ 
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The next lemma shows that the degree sequence in a typical realization 
of om' random graph satisfies the conditions of Lemma 4.1. 

Lemma 4.2. Let G he drawn from the probability measure P^ and let 
di,...,dn be the degree sequence of G. Let L := maxi<j<„ |/3jj and let c€ 
(0,1) be any constant. Then there are constants G > and ci,C2 G (0,1) 
depending only on L and a constant C3 € (0,1) depending only on L and c 
such that if n> G, then with probability at least 1 — 2n~^, C2(n — I) <di < 
ci(n — 1) for all i and 

^ inf {ymm{d,,\B\} + \B\(\B\-l)-y"di\ 

-^ ' ' ^'' '- ^j^B ieB ^ 



>C3 

Proof. Let 



61ogn 



n 



d, 



-^T 



e/3.+/3i 






Note that for each i, di is a sum of independent indicator random variables 
and E((ij) = ndi. Therefore, by Hoeffding's inequality [26], 

P(|di-(ii|>x)<2e-^'/2n_ 
Thus, if we let E be the event 

< max \di — di\> yGnlogn > , 
then by a union bound, 

HE) < \. 
n^ 

Now, clearly, there are constants d^ < 1 and C2 > depending only on L 
such that C2(n — 1) < (ij < ^[{n — 1) for all i. Therefore, under E'^, if n is 
sufficiently large (depending on L), we get constants ci,C2 depending only 
on L such that C2(n — 1) < dj < ci(n — 1) for all i. 
Next, define 

gidu-. ■,dn,B) := ^mm{dj, \B\} + \B\{\B\ - 1) -^di. 

j^B ieB 

Note that 

n 

\g{di,...,dn,B) - g{di,...,dn,B)\ <y^Jdi - di\ <rimax|dj - di\. 
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Moreover, following the notation introduced in the proof of Lemma 4.1, we 
have 

g{di,...,dn,B) 

j^B ieB 

>\B\{\B\-1)-Y,df 

ieB 

= Y. {l-pij)>C4\B\{\B\-l), 

where C4 € (0, 1) is a constant depending only on L. Thus, under £"^, n> C 
and \B\ >cn we have 

g{di,...,dn,B) >c-ir? -r?!'^ yJ^Xogn, 

where C3 G (0, 1) is a constant depending only on L and c. The proof is 
complete. D 

Proof of Theorem 1.3. Let E be the event defined in the proof of 
Lemma 4.2. Let C, ci, C2 be as in Lemma 4.2. By lemmas, if E^ happens and 
n> C, then a solution /3 of (3) exists and satisfies \j3\oo < C{L), where C{L) 
generically denotes a constant that depends only on L. This proves the 
existence of the MLE. The uniqueness follows from Theorem 1.5. 

The proof of the error bound uses Theorem 1.5. Let xq = /3 and define 
{xfc}fc>i as in Theorem 1.5. A simple computation shows that the ith com- 
ponent of xq — xi is simply log{di/di). Under E^ and n> C, this is bounded 
by C {L)y^n^^ log n. The error bound now follows directly from Theorem 1.5. 

Finally, to remove the condition n > C, we simply increase C{L) in The- 
orem 1.3 so that 1 — C{L)n~'^ < for n <C. The proof of Theorem 1.3 is 
complete. D 

Addenda. (A) Practical remarks on the MLE. Theorem 1.3 shows that 
with high probability, under the P/j measure, for large n the MLE exists and 
is unique. In applications, a graph is given and Theorem 1.3 may be used to 
test the P^ model. The MLE may fail to exist because the maximum is taken 
on at /3j = iboo for one or more values of i. For example, with n = 2 vertices, 
an observed graph will either have zero edges or one edge. In the first case, 
the likelihood is 1/(1 + e^^"*"^^), maximized at /3i = /32 = — oo. In the second 
case the likelihood is e^i"'"^2/(l + e^^^^^), maximized at Pi= (32 = oo. Here, 
the MLE fails to exist with probability one. 

Similar considerations hold when the observed graph has any isolated 
vertices and for a star graph. We conjecture: Let G be a graph on n vertices. 
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The MLE for the /3-model exists if and only if the degree sequence hes in 
the interior of the convex polytope conv(P) defined in Theorem 1.4. 

In cases where the MLE does not exist, it is customary to add a small 
amount to each degree (see the discussion in [11]). This is often done in 
a convenient and principled way by using a Bayesian argument. 

(B) Conjugate prior analysis for the P -model. Background on conjugate 
priors for exponential families is in [20] and [24, 25]. The /3-model 

P;3(G) = Z(/3)-ie^-i*(^)'^% /3€M",Z(/3)= J| (l + e^'+^O 

l<i<j<n 

has sufficient statistic d = (di,. . . ,dn)- Here d takes values in V, the set 
of degree sequences for graphs on n vertices. Thus, P^ induces a natural 
exponential family on D with a base measure fi that does not depend on (3. 
Following notation in [20], write 

P^(d)=/x(d)e^-'^-'"(^) with m(/3)= log Z(/3)= ^ log(l + e^'+^^). 

l<i<j<n 

Following [20], for do in the interior of conv(X') and uq > 0, define the con- 
jugate prior of M" by 

Here Z(no,do) is the normalizing constant, shown to be positive and finite 
in [20]. By the theory in [20], Vm(/3) = E/3(d) and 

E^„o,do (Vm(/3)) = E^„^,,^ (E;3(d)) = do. 

This identity characterizes the prior vTng do- The posterior, given an observed 
degree sequence d(G), is 

7I"no+l,(d(G)+nodo)/{no+l) • 

Clearly, the mode of the posterior can be found by using the iteration of 
Theorem 1.5. The proof of Theorem 1.5 shows that the mode exists uniquely 
for any observed d(G). The posterior mean must be found using standard 
Markov chain Monte Carlo techniques. 

A natural way to obtain feasible prior mean parameters [i.e., values of do 
that lie within the interior of conv(P)] is to consider a model of random 
graphs that puts positive mass on every possible graph on n vertices and take 
its expected degree sequence. For example, the Erdos-Renyi graph G{n,p), 
for < p < 1, is one such model. Its expected degree sequence is (c, c, . . . , c) 
where c = {n — l)p. Thus, (c, c, . . . , c) is a feasible mean parameter for every 
c G (0, n — 1). Similarly, the expected degree sequence in any of the standard 
models of power law graphs is a feasible value of do that has power law 
behavior. 
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5. Proof of Proposition 1.2 (characterization of the interior). 
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Proof of Proposition 1.2. Let us begin by restating the Erdos- 
Gallai criterion from Section 1. Suppose di>d2>--->dn are nonnegative 
integers. The Erdos-Gahai criterion says that di,. . . ,dn can be the degree 
sequence of a simple graph on n vertices if and only if XlILi di is even and 
for each 1 < A: < n, 



^di<k{k-l)+ ^ mhi{di,k}. 



i=l 



i=k+l 



Now take any function / G D'[0, 1] and let 



Gf{x):= va:ui{f{y),x]dy + x 



f{y)dy. 



Clearly, Gf{x) is continuous as a function of x. If / € J-", the E~G crite- 
rion clearly shows that Gj must be a nonnegative function. We claim that 
this implies that if / belongs to the interior of J-", then Gf{x) must be 
strictly positive for every x E (0,1]. Otherwise, there exists x G (0,1] such 
that Gf{x) = 0. If we show that there exists a sequence fn^ f ^n the modi- 
fied L^ topology such that Gf^{x) < for each n, then we get a contradiction 
which proves the claim. This is quite easily done by producing /„ that is 
strictly bigger than / in [0, x) and equal to / elsewhere, all the while main- 
taining left-continuity. 

Similarly, it is clear that any / € J-" must take values in [0, 1]. If / attains 
or 1, then we can produce a sequence fn^^ f whose ranges are not contained 
in [0, 1] and, therefore, / cannot belong to the interior of J-". 

Thus, we have proved that if / belongs to the interior of T, then / must 
satisfy the two conditions of Proposition 1.2. Let us now prove the con- 
verse. Suppose / G D'[0, 1] such that < ci < /(x) < C2 < 1 for all x G [0, 1] 
and Gf{x) > for all x G (0, 1]. We have to show that any function that is 
sufficiently close to / in the modified L^ norm must belong to J-. 

To do that let us first prove that j ^T . Take any n. Let d^ = [n/(i/n)J , 
i = 2, . . . , n, and d\ = [ra/(0)J . Since / is nonincreasing, we have di >d2 > 
■ ■ ■ ^ t?n- Increase some of the d^^s by 1, if necessary, so that ^df is even 
(and monotonicity is maintained). With this construction, it is clear that 



d? 






d" 




^1 


-/(o) 


+ 


""n 


-/(I) 


n 






n 





1 



+ -E 



^_,(i 

n \n 



4 
< -. 

n 



Thus, if d" denotes the vector (c?", . . . , d"), then d" converges to the scaling 
limit /. We need to show that for all large enough n, d" is a valid degree 
sequence. 
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Since / is bounded and nonincreasing, 

lim / \f{x) — f{\nx]/n)\dx = 



and so uniformly in 1 <k <n, 

Er=fe+i min{rf?, k} + k{k - 1) - E ti d- 



Gf{k/n] 



<e(n) 



where e{n) ^- as ?i — > oo. Thus, there exists a sequence of integers {ko^n)}, 
where k{){n)/n — )• as n — )• oo, such that whenever k > kQ{n), we have 

n k 

Y^ mm{d1,k} + k{k-l)-Y^d'^>0. 

i=k+l i=l 

Again, there exists c'l < 1 and C2 > such that if n is sufficiently large, we 
have C2 < d^/n < c'l for all i. Suppose n is so large that /co(?i)/n < C2 and 
(1 — c'i)n — ko{n) > 0. Then, if /c < ko{n), we have 

n k 

Y, minK,A;} + fc(A;-l)-^C 

i=k+l i=l 

n k 

> 2, min{c2n, k} + k{k — 1) — 2. nc'i 

i=k+l i=l 

= (n — k)k + k{k — 1) — diuk 

= ((1 - c;)n - k)k + k{k - 1) > 0. 

Thus, for n so large, we have that for all 1 <k <n, 

n k 

Y min{<'. A;} + A:(fc - 1) - ^ d'^ > 0. 

i=k+l i=l 

By the Erdos-Gallai criterion, this shows that (rf", . . . , d^) is a valid degree 
sequence. 

Thus, we have shown that any / that satisfies the two conditions of Propo- 
sition 1.2 must belong to J-". Now we only have to show that if / satisfies 
the two criteria, then any h sufficiently close to / in the modified L^ norm 
must also satisfy them. 

Note that Gf is a continuous function that is positive in (0, 1]. Moreover, 
for ah < X < 1, 

\Gf{x)-Gf,{x)\<\\f-f'\\,, 

so if /n — )• / in the modified L^ norm, then Gf^ — )■ Gj in the supnorm. Thus, 
for any e > there exists 6 > such that whenever ||/i — /||i/ < (5, we have 
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Gh{x) > for all x € [e, 1]. We also have that ci — 5 < h[x) < C2 + 5 for all 
< X < 1 . Choosing (5, e > small as necessary, we can ensure that ci — 5> e 
and 1 — e — (5 — C2>0. Fix such e, 5 and h. Then, for x £ (0, e), we have 

Gh{x) > / m.m{ci — 6,x}dy + X — / {c2 + S)dy 
Jx Jo 

= {1 — x)x + X — {C2 + S)x 
= {1 — e — 6 — C2)x + x^ > 0. 

But we also have Gh{x) > for x G [e, 1] by the choice of 6. Thus, we have 
proved that there exists 6 > such that whenever ||/i — /||i' < S, we have 
Gh{x) > for all x € (0, 1]. Choosing 6 sufficiently small, we can ensure that 
the range of h does not contain or 1. The proof of Proposition 1.2 is 
complete. D 

Proposition 1.2 can be extended into a complete version of the Erdos- 
Gallai criterion for graph limits. Suppose that W{x,y) is a symmetric func- 
tion from [0,1]^ into [0,1]. In [18], Section 4, it is shown that the correct 
analog of the degree distribution for the graph limit W is the distribution 
of the random variable 

(16) X= f W{U,y)dy, 

Jo 

where U is a random variable distributed uniformly in [0, 1] . If a sequence 
of graphs converges to W then the distribution of the random variable di/n 
(where i is chosen uniformly from n vertices and di is the degree of i) con- 
verges to X in distribution. The following result characterizes limiting degree 
variates. 

Proposition 5.1. Let X be a random variable with values in [0,1]. Let 
D{x) = sup{y :P(X > y) > x}. Then X has the representation (16) if and 
only if for all x € (0, 1] 



D{y)dy < X + / in.m{D{y),x}dy. 

J X 



The proof is essentially as given above, approximating VK by a sequence of 
finite graphs and using the Erdos-Gallai criterion. We omit further details. 

6. Proof of Theorem 1.1 (convergence to graph limit). 

6.1. Preliminary lemmas. We need a couple of probabilistic results be- 
fore we can embark on the proof of Theorem 1.1. The first one is a simple 
application of the method of bounded differences for concentration inequal- 
ities. 
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Lemma 6.1. Let H be a finite simple graph of size < n. Let G be a ran- 
dom graph on n vertices with independent edges. Let t(H, G) be the homo- 
morphism density of H in G, defined in (1). Then for any e > 0, 

¥{\t{H,G) -m{H,G)\> e) <2e-^'''''% 
where C is a constant that depends only on H . 

Proof. The proof is a simple consequence of the bounded difference in- 
equahty [36]. Note that the quantity t{H, G) is a function of the edges of G, 
considered as independent Bernouhi random variables. When a particular 
edge is added or removed (i.e., the corresponding Bernoulli variable is set 
equal to 1 or 0), hom(ff, G) is altered by at most Cw^^^""^, where C is 
a constant that depends only on H . This is because when we fix an edge, we 
are fixing its two endpoints, which leaves us the freedom of choosing the re- 
maining |y(i7)| — 2 vertices arbitrarily when constructing a homomorphism. 

Thus, alteration of the status of an edge changes t{H^ G) by at most Gn~'^. 
The bounded difference inequality completes the proof. D 

The second preliminary result that we need is a kind of local limit theorem 
that we need to pass from the /3-model to graphs with given degree sequence. 

Let d = (di, . . . , dn) be a valid degree sequence on a graph of size n. Let 
G = {y, E) be a random graph on n vertices labeled 1, . . . , n so that edges i, 
j are connected with probability pij satisfying di = Ylj^^iP-ij ^'^d so that 
(^ < Pij < 1 — 5 for some fixed < 5 < 2 • Let Wij denote the indicator that 
(i,j) is an edge in G. We obtain a lower bound on the probability that G 
has degree sequence d. 

Lemma 6.2. For any e > and large enough n, the random graph G has 
degree sequence d with probability at least 2 exp(— log((5)n'^'^'"''^). 

We first prove the following claim about the existence of 0-1 contingency 
tables. An m x n 0-1 contingency table with integer row and column sums 
ri , . . . , r^ and ci , . . . , c^ is an m x n matrix whose entries are or 1 and whose 
ith row and jth column sum to r^ and Cj, respectively. Denote the conjugate 
sequences as r* = #{rj : rj > i} and c* = #{cj : Cj > i}. Let (r[j]), (c[j]) denote 
the order statistics of (r^) and (q), that is, permutations of the sequences 
such that r[i] > r^j > • • • > r[m] and C[i] > cpj > • • • > cj^] • 

A condition of Gale and Ryser [23, 49] says that there exists a 0-1 con- 
tingency table for row and column sums ri , . . . , r^ and ci , . . . , c„ if and only 
if Ei=i ^i = Ei=i Ci and 

k k 

(17) En^]^E<' ^<k<m, 
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k k 



(18) E^w^E^' i<fc<n. 



i=l 1=1 



Claim 6.3. Let < 5 < 2 o.nd let (pij) be an m x n matrix such that 
S ^Pij ^^ — ^- Suppose that {ri) and (cj) are integer sequences satisfying 
the following: 

Em v-^n 

i=l ^« — 2^1=1 ^i! 

• ki - YTj=iPij\ ^ i'^^"- for l<i< m; 

T/ien t/iere exists a 0-1 contingency table with row and column sums (rj) 
and {ci). 

Proof. We establish that the Gale-Ryser conditions hold. Without loss 
of generality we may assume that ri > r2 > • • ■ > r^. Then condition (17) is 
equivalent to 

k k k n n 

(19) E'^^ ^ E< = EE^{'=.>0 = J^min{fc,c,}. 

i=l 1=1 1=1 j=l j=l 

Now 

fc n 



E^^^EE^^i + i'^'^^ 



i=l i=l J = l 

and hence, 



n n ( m ^ \ 

> min{/c, Cj} > > min< k, > pjj — — 5 ?7i > 

i=i j=i I i=i J 

k n n ( k m ^ 

- E E^^i + E ™^^i ^ " E^*j' E ^^i ~ 7^' 



k n 

, _ m 
4 

i=l j=l j=l (^ i=l i=k+l 



k n n 



> 2, / Pij + / niin< (5A;, {m — k)6 — —6 m> 

i=ij=i j=i ^ J 

y Tj + n I min< dk, (m — fc)5 — -6 m> — -6 k\. 



k 



where we used the fact that 6 < pij < 1 — 6. Now 6k > j6 k and when 



28 



S. CHATTERJEE, P. DIACONIS AND A. SLY 



l<k<m{l-6+{5^), 

(m - k)5 - l6^m = m6{l - l6) - k6 

>m6{l-6 + l6^){l + l6)-k5 

>k6{l + l6)-k5 = l5^k 

and hence, n{inin{Sk, {m — k)5 — j5'^m} — jd'^k) > 0. To establish equa- 
tion (19) it then suffices to consider m(l — 6 + j6'^) < k <m. In this case, 

™ 1 1 

Cj < > Pij -\ — 6 m < {1 — 5)m -\ — 5 m< k 



i=l 



and so 






j=i 



i=l 



estabhshing (17). Condition (18) follows similarly and hence, there exists 
a 0-1 contingency table with the prescribed row and column sums. D 

We are now ready to prove Lemma 6.2. 



Proof. We split the n vertices into subsets A = l,2, . . . ,n — n'^ and B = 
n — n" + 1, . . . , re where a = ^ + e. For 1 < ^ < j < |^| , choose Wij according 
to Pij. Let Q denote the event that the following conditions hold: 



• For all ieA 
(20) 



<„(H-^)/2. 



Yl ^^i~ Y pi^ 

j€A\{i} jeA\{i} 

> That the total number of edges in the subgraph induced by A satisfies 

21) E(^'- E m^<Y^^<Y(d■^- E w,j)+\B\{\B\-l). 



i&A 



j(iA\{i} 



ieB 



ieA 



j€A\{i} 



Both conditions hold with high probability by simple applications of Hoeffd- 
ing's inequality [26]. For example, the first follows from Hoeffding's inequal- 
ity as 



E ( 

jeA\{i} 



Wij - Ewij] 



>^(i+.)/2^<2e- 



(l/2)n= 



and taking a union bound over i £ A. 
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We will show that given Q there is always a way to add edges between 
vertices in i? x F so that the graph has degree sequence d. First we choose 
any assignment of the edges {wij)ij^B in B x B so that the total number of 
edges equals 

\eB ieA ^ jeA\{i} 

which is an integer because the sum of the degrees is even and is between 
and ^\B\{\B\ - 1) by equation (21). 

It remains to assign edges between A and B so that the graph has degree 
sequence d. This is exactly equivalent to the question of finding a 0-1 con- 
tingency table with dimensions \A\ x \B\, row sums ri = di — X]jgA\{jj^«j 
for i G A and column sums Ci= di — X^jg^x /ji Wij for i G S. 

Condition (20) guarantees that rj = [1 + o(l)] J2jeBPij ^^"^ since \B\ = 
o[\A\)], we have that Cj = [1 + o{l)]'^-^^pij uniformly in n. Hence, by 
Claim 6.3 a 0-1 contingency table with row and column sums (r^) and (cj) 
exists. 

Hence, whenever the edges {wij)ij^A satisfy G there exists at least one 
way to assign the other edges so that the graph has degree sequence d. 
Since any configuration {wij)i^v,j€B has probability at least jl^H-^l and is 
independent of G, the probability that G has the degree sequence d is at 
least P{G)exp{—log{5)n^^'^) and the result follows since G holds with high 
probability. D 

An alternative approach in the above lower bound could be through the 
enumeration of the number of graphs of a particular degree sequence as car- 
ried out in [10]. In fact, this approach would give a better lower bound than 
the one we obtain. This was brought to our attention recently by Alexander 
Barvinok. 

6.2. Proof of Theorem 1.1. Let d", G„ and / be as in the statement 
of the theorem. By Proposition 1.2 we know that / has the following two 
properties: 

A. There are two constants ci > and C2 < 1 such that ci < f[x) < C2 for 
allxG [0,1]. 

B. For each < 6< 1, 



inf< / m.m{f{y),x}dy + x — / 
^>HA Jo 



fiy)dy}>0. 



(The infimum is positive because the term within the brackets is a positive 
continuous function of x.) Now fix n and for each B C {1, . . . ,n}, consider 
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£{B):=Y,^^{dl\B\} + \B\{\B\-l)-Y,d7. 

Under the assumption that d^ > d!^ > ■ ■ ■ > d"!^, we claim that for each 1 < 
k <n, £{B) is minimized over all subsets B of size k when B = {1, . . . , k}. 
To prove this, take any B of size k. Suppose there is a G -B and b ^ B such 
that b <a. Let B' = {B\{a}) U {b}. Then clearly, since d^ > d^, we have 

^ mm{d], k}>Y^ min{d;^ k} 
JiB jiB' 

and 

i&B ieB' 

Thus, £{B) > £(B'), which proves the claim. Now by the definition of conver- 
gence of degree sequences and the fact that / is bounded and nonincr easing, 



^-^ n n 

i=l 



k/n 







(22) 



< 



E 



n n 



f{y)dy 



i/n 



{i-l)/n 



f{y)dy 



< 



-E 

i=l 



^-f(- 

n \n 



+ / \fix)-f{\nx]/n)\dx-^0 




for 1 <k <n. Similarly 



y~] -min<^^,-l-/ mm{f{y),k/n}dy^O 

uniformly in l</i;<?iasn— >• oo. Hence, we have that for any b € (0, 1), 
1 



mm 



n^ BQ{l,...,n},\B\>bn 



£{B) 



mm< 

k>bn 



inf 

x>b 



" 1 - r^i k\ 

y -mini —,-> + 
=fc+i "^ '' 



k(k-\) 



j=k+ 
1 



n^ 



Z — i/ Ti n 



i=l 



n n 



min{/(y),x}dy + x 



f{y)dy 



as n — > oo. 



Thus, we can apply properties A and B of the function /, the definition 
of scaling limit of degree sequences and Lemma 4.1 to conclude that for 
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all large n, a solution /3" = (/3", . . . ,/?^) to (3) for d" exists and |/3"|oo is 
uniformly bounded. 

For each n, define a function (7„ : [0, 1] — >■ M as 

gn{x):=(3r ifi^<x<- 
n n 

and let gn{0) :=/?". Now fix two positive integers m,n and let 

N := mn. 
Define a vector xq = (xq,!, ■ • • , xq^n) G 1^^ as follows: 

XQ,i = Pk if m{k - 1) + 1 < i < 7nk. 

In other words, 

^0 = (Pl ) Pi ) • • • ) Pi ) P2 ' P2 ) • • • ' P2 ' • • • 1 Mn ' Pn ' • • • ) Pn ) ' 

where each /3^ is repeated m times. For i > 1 define x^ = (p{xi_i) as in 
Theorem 1.5 (with N in place of n). Equivalently, 

d^ /N 
(23) xt,i -xi_i^i = log df -\ogyi^i^i = log — - — — - , 

where 



yt 



-Sr 






Note that by definition of yo.i and xo,i, if m(/c — 1) + 1 < i < mk, 
yo,i - mdfc = (m - 1)- ^fl^r < m. 

Consequently, if m{k — 1) + 1 < i < mk, 

(24) \yo,i/N - dl/n\ < 1/n. 

Hence, by equation (2) [similarly to (22)] it follows that 

1 ^ 
-j;|yo,i/iV-df/iV|<ei(n) 

«=i 

uniformly in N where ei(n) — t- as n — t- oo. From (2), (23), (24) (and im- 
plicitly using the continuity of log, property A of the function / and the 
uniform boundedness of |/3"|oo), we see that 

|xo-xi|i <Ne2in) 
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uniformly in m where e2{n) — )■ as n — t- oo. Since l/S^loo is uniformly bounded 
in n by Theorem 1.5, it follows that for large enough n,m, 

(25) |x,-/3^U<i^e' 

for some K and < ^ < 1 independent of n and m. Hence, for some K', also 
independent of n,m, 

SUp|x£|oo <K'. 

Consequently, by Lemma 2.2 we have that 



(26) 



|xo-x,|i<|^(2e2^^')M|xo-xi|i. 



Combining equations (25) and (26) and using the fact that |x|i < A^|x|oo we 
have that 

|xo - /3^|i < |xo - x,|i + |x, - /3^|i < I ^(2e2^')M iVe2(n) + i^e^iV. 

Now taking i = i{n) to infinity slowly enough so that 



'^(2e^^')42(n)^0 



it follows that 

|xo-/3^|i<iVe3(ri) 
uniformly in m where ei(n) — t- as n ^ oo. But 

|xo-/3^|i = N\\gn- gN\\i, 

where || ■ ||i is the usual L^ norm on functions on [0, 1]. Thus, 

\\9n - 5m||i < Ibn " 9n\\oo + \\9m - 9n\\oo < ^sin) + e^{m). 

This shows that the sequence {9n\ is Cauchy under the L^ norm and thus 
there exists a uniformly bounded function g* such that \\gn — 9* ||i — )• 0. Now, 
for each n define a function /„ as 

^"^^^■"X l + eS-W+Snto)"^^- 

Now by the uniform boundedness of the |(7n|oo, 
-I 







1 Q9*{x)+9*{y) 



dx^O 
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as n — 7- oo. But from the relation between /3" and d", it is easy to see that 
for X G (0, 1] that fn{x) = '^^x] /™ ~'~ 0{l/n) and hence, 

lim||/-/J|i^O. 

n 

It follows that 

(27) f{x)= f W*{x,y)dy a.e., 

JO 



where 



W*{x,y) 



gg*(x)+g*(y) 



1 _|_ e5*(a;)+g*fe) ' 

We now adjust g* on a set of measure so that equation (27) holds for all 
X. Set V:K^(0,1) as 

/■I gz+g*(y) 

By construction and since g* is uniformly bounded, it follows that iIj{z) is 
continuous, strictly increasing and bijective. By equation (27) we have that 

f{x) = i^{g*{x)) a.e. 

and hence, if we set 

gix) = ^p-\f{x)), 

then g{x) = g*{x) almost everywhere. Then for all x G [0, 1], 

f{x)= / W{x,y)dy, 
Jo 



^a{x)+9(y) 



where 

W{x,y)- ^ ^ gg(3,)+g(y) • 

Moreover, by the properties of "0 and /, we have that g € D'[0, 1] and its 
points of discontinuity are the same as /. 

Let us now prove that g is the only function in Z)'[0, 1] with the above 
relationship with /. Suppose h is another such function. Fix any n. Define 
a vector xq = (a;o,i, • • • , 3;o,n) S IR" as 

Xo^i:=h{i/n), i = l,...,n. 

For each 1 <i <n, define 

^* '~ Z^ I _|_ QXo,i+Xoj • 
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Then since heD'[0,l], 



(28) sup \yi/n - f{i/n) \ = sup 



^h{i/n)+h{y) 



<e4(n), 



where e^(n) -^^ as n — )> oo. Define xi in terms of xq and d" as in Theo- 
rem 1.5. Then for each z, 

(P- In 
x\,i - XQi = logd" - logyj = log -'^-j-. 

From (2), (28) and the above identity (and imphcitly using the property A 
of /), we see that 

|xi -xoloo <e5{n), 

where e5(n) — ?• as n — )• oo. Thus, by Theorem 1.5 we get 

|xo - fy\oo <eQ{n), 

where EQ{n) — )■ as n — > oo. This implies that \\h — gn\\i -^ and hence, 
that h = g a.e. Since we assumed both h and g are in -D'[0, 1] this implies 
that g = h on (0, 1]. To show that ^(0) = /i(0), observe that since g = h on 
(0,1], 

1 ^h(0)+hiy) ,-1 ^h{0)+g{y) 



and therefore, by the injectivity of -0, 5(0) = /i(0). 

Now fix a finite simple graph H. Let (3'^ be as above. Let G'^ denote 
a random graph from the /3"-model. Let d^ be the degree sequence of G' . 
Then it is easy to see that conditional on the event {d^ = d„} the law of G'^ 
is the same as that of G„. 

By Lemma 6.1, given any e > 0, we have that 

n\t{H,G'^)-m{H,G'^)\ >e)< e-^^"', 

where Ci is a constant that depends only on H and e. By Lemma 6.2, we 
know that 

P(d; = dO>e-^^"''', 

where C2 is another constant that depends only on |/3|oo- Thus, 

F{\t{H,Gn)-mH,G'J\>e) = F{\t{H,G'J-Et{H,G'J\>eK = dn) 

\tiH,G'J-Et{H,G'^)\>e) 



<e" 



-Czr? 



where C3 is a constant depending on ff, e and |/3|oo- Since gn -^ g, it is easy 
to prove that G^ converges to W almost surely. From the above inequality, 
it follows that G^ and G„ must have the same limit almost surely. The proof 
of the theorem is complete. 



RANDOM GRAPHS WITH A GIVEN DEGREE SEQUENCE 35 

Acknowledgments. The authors are indebted to Joe BHtzstein for many 
helpful tips and pointers to the literature and Martin Wainwright for the ref- 
erences to [4, 16, 52]. We particularly thank Alexander Barvinok for calling 
our attention to [10] and suggesting possible connections to our work and 
Svante Janson for a very careful reading of the manuscript and pointing out 
numerous small errors. Last, we thank the Associate Editor for a number of 
useful comments. 



[1 

[2; 
[3; 
[4; 
[s; 
[6; 

[s; 

[9 

[lo; 

[11 
[12; 

[13; 

[14 

[is: 



REFERENCES 

Aldous, D. J. (1981). Representations for partially exchangeable arrays of random 
variables. J. Multivariate Anal. 11 581-598. MR0637937 

Austin, T. (2008). On exchangeable random variables and the statistics of large 
graphs and hypergraphs. Probab. Surv. 5 80-145. MR2426176 

Austin, T. and Tao, T. (2010). Testability and repair of hereditary hypergraph 
properties. Random Structures Algorithms 36 373-463. MR2666763 

Barndorff-Nielsen, O. (1978). Information and Exponential Families in Statistical 
Theory. Wiley, Chichester. MR0489333 

Barvinok, A. (2010). What does a random contingency table look like? Combin. 
Probab. Comput. 19 517-539. MR2647491 

Barvinok, A. (2010). On the number of matrices and a random matrix with 
prescribed row and column sums and 0-1 entries. Adv. Math. 224 316-339. 
MR2600999 

Barvinok, A. and Hartigan, J. A. (2009). An asymptotic formula for the number 
of nonnegative integer matrices with prescibed row and column sums. Preprint. 
Available at http://arxiv.org/abs/0910.2477. 

Barvinok, A. and Hartigan, J. A. (2009). Maximum entropy Edgeworth estimates 
of volumes of polytopes. Preprint. Available at http;//arxiv.org/abs/0910.2497. 

Barvinok, A. and Hartigan, J. A. (2010). Maximum entropy Gaussian approx- 
imations for the number of integer points and volumes of polytopes. Adv. in 
Appl. Math. 45 252-289. MR2646125 

Barvinok, A. and Hartigan, J. A. (2010). The number of graphs and a random 
graph with a given degree sequence. Preprint. Available at http://arxiv.org/abs/ 
1003.0356. 

Bishop, Y. M. M., Fienberg, S. E. and Holland, P. W. (1975). Discrete Multi- 
variate Analysis: Theory and Practice. MIT Press, Cambridge, MA. 

Blitzstein, J. and Diaconis, P. (2009). A sequential importance sam- 
pling algorithm for generating random graphs with prescribed de- 
grees. Preprint. Available at http://www.people.fas.harvard.edu/~blitz/ 
BlitzsteinDiaconisGraphAlgorithm.pdf. 

BORGS, C, Chayes, J., LovASZ, L., Sos, V. T. and Vesztergombi, K. (2006). 
Counting graph homomorphisms. In Topics in Discrete Mathematics. Algorithms 
Combin. 26 315-371. Springer, Berlin. MR2249277 

BoRGS, C, Chayes, J. T., LovAsz, L., Sos, V. T. and Vesztergombi, K. (2008). 
Convergent sequences of dense graphs. I. Subgraph frequencies, metric properties 
and testing. Adv. Math. 219 1801-1851. MR2455626 

BoRGS, C, Chayes, J., LovAsz, L., Sos, V. T. and Vesztergombi, K. (2007). 
Convergent sequences of dense graphs II. Multiway cuts and statistical physics. 
Preprint. Available at http://research.microsoft.com/en-us/um/people/borgs/ 
papers/conright.pdf. 



36 S. CHATTERJEE, P. DIACONIS AND A. SLY 

[16] Brown, L. D. (1986). Fundamentals of Statistical Exponential Families with Applica- 
tions in Statistical Decision Theory. Institute of Mathematical Statistics Lecture 
Notes— Monograph Series 9. IMS, Hayward, CA. MR0882001 

[17] DiACONiS, p. and Freedman, D. (1984). Partial exchangeability and sufficiency. In 
Statistics: Applications and New Directions (Calcutta, 1981) 205-236. Indian 
Statist. Inst., Calcutta. MR0786142 

[18] DiACONiS, P., Holmes, S. and Janson, S. (2008). Threshold graph limits and ran- 
dom threshold graphs. Internet Math. 5 267-320 (2009). MR2573956 

[19] DiACONiS, p. and Janson, S. (2008). Graph limits and exchangeable random graphs. 
Rend. Mat. Appl. (7) 28 33-61. MR2463439 

[20] DiACONiS, p. and Ylvisaker, D. (1979). Conjugate priors for exponential families. 
Ann. Statist. 7 269-281. MR0520238 

[21] Erdos, p. and Renyi, A. (1960). On the evolution of random graphs. Publ. Math. 
Inst. Hungar. Acad. Sci. 5 17-61. MR0125031 

[22] Erdos, P. and Gallai, T. (1960). Graphen mit punkten vorgeschriebenen grades. 
Mat. Lapok 11 264-274. 

[23] Gale, D. (1957). A theorem on flows in networks. Pacific J. Math. 7 1073-1082. 
MR0091855 

[24] Gutierrez-Pena, E. and Smith, A. F. M. (1995). Conjugate parameterizations for 
natural exponential families. J. Amer. Statist. Assoc. 90 1347-1356. MR1379477 

[25] Gutierrez-Pena, E. and Smith, A. F. M. (1996). Erratum: "Conjugate parame- 
terizations for natural exponential families." J. Amer. Statist. Assoc. 91 1757. 
MR1439120 

[26] HOEFFDING, W. (1963). Probability inequalities for sums of bounded random vari- 
ables. J. Amer. Statist. Assoc. 58 13-30. MR0144363 

[27] Holland, P. W. and Leinhardt, S. (1981). An exponential family of probability 
distributions for directed graphs. J. Amer. Statist. Assoc. 76 33-65. MR0608176 

[28] Hoover, D. N. (1982). Row-column exchangeability and a generalized model for 
probability. In Exchangeability in Probability and Statistics (Rome, 1981) 281- 
291. North-Holland, Amsterdam. MR0675982 

[29] Hunter, D. R. (2004). MM algorithms for generalized Bradley-Terry models. Ann. 
Statist. 32 384-406. MR2051012 

[30] Jackson, M. O. (2008). Social and Economic Networks. Princeton Univ. Press, 
Princeton, NJ. MR2435744 

[31] KOLACZYK, E. D. (2009). Statistical Analysis of Network Data: Methods and Models. 
Springer, New York. 

[32] Lauritzen, S. L. (1988). Extremal Families and Systems of Sufficient Statistics. 
Lecture Notes in Statistics 49. Springer, New York. MR0971253 

[33] Li, L., Alderson, D., Doyle, J. C. and Willinger, W. (2005). Towards a theory 
of scale-free graphs: Definition, properties, and implications. Internet Math. 2 
431-523. MR2241756 

[34] LovASZ, L. and Szegedy, B. (2006). Limits of dense graph sequences. J. Combm. 
Theory Ser. B 96 933-957. MR2274085 

[35] Mahadev, N. V. R. and Peled, U. N. (1995). Threshold Graphs and Related Topics. 
Annals of Discrete Mathematics 56. North-Holland, Amsterdam. MR1417258 

[36] McDiarmid, C. (1989). On the method of bounded differences. In Surveys in Com- 
binatorics, 1989 (Norwich, 1989) (J. Siemons, ed.). London Mathematical So- 
ciety Lecture Note Series 141 148-188. Cambridge Univ. Press, Cambridge. 
MR1036755 



RANDOM GRAPHS WITH A GIVEN DEGREE SEQUENCE 37 

[37] McKay, B. D. (2010). Subgraphs of dense random graphs with specified degrees. 
Preprint. Available at http://arxiv.org/abs/1002.3018. 

[38] McKay, B. D. and Wormald, N. C. (1990). Asymptotic enumeration by degree 
sequence of graphs of high degree. European J. Combm. 11 565-580. MR1078713 

[39] MOLLOY, M. and Reed, B. (1995). A critical point for random graphs with a given 
degree sequence. Random Structures Algorithms 6 161-179. MR1370952 

[40] MOLLOY, M. and Reed, B. (1998). The size of the giant component of a ran- 
dom graph with a given degree sequence. Combin. Probab. Comput. 7 295-305. 
MR1664335 

[41] Newman, M. E. J. (2003). The structure and function of complex networks. SIAM 
Rev. 45 167-256 (electronic). MR2010377 

[42] Newman, M. E. J., Barabasi, A.-L. and Watts, D. J. (eds.) (2006). The Structure 
and Dynamics of Networks. Princeton Studies m Complexity. Princeton Univ. 
Press, Princeton, NJ. MR2352222 

[43] Park, J. and Newman, M. E. J. (2004). Statistical mechanics of networks. Phys. 
Rev. E (3) 70 066117, 13. MR2133807 

[44] PORTNOY, S. (1984). Asymptotic behavior of M-estimators of p regression parame- 
ters when p^/n is large. I. Consistency. Ann. Statist. 12 1298-1309. MR0760690 

[45] PORTNOY, S. (1985). Asymptotic behavior of M estimators of p regression parame- 
ters when p^ /n is large. II. Normal approximation. Ann. Statist. 13 1403-1417. 
MR0811499 

[46] PORTNOY, S. (1988). Asymptotic behavior of likehhood methods for exponential 
families when the number of parameters tends to infinity. Ann. Statist. 16 356- 
366. MR0924876 

[47] PORTNOY, S. (1991). Correction: "Asymptotic behavior of M estimators ofp regres- 
sion parameters when p^/n is large. II. Normal approximation." Ann. Statist. 
19 2282. MR1135181 

[48] Robins, G., Snijders, T., Wang, P., Handcock, M. and Pattison, P. (2007). Re- 
cent developments in exponential random graph (p*) models for social networks. 
Social Networks 29 192-215. 

[49] Ryser, H. J. (1957). Combinatorial properties of matrices of zeros and ones. Canad. 
J. Math. 9 371-377. MR0087622 

[50] Simons, G. and Yao, Y.-C. (1999). Asymptotics when the number of parame- 
ters tends to infinity in the Bradley-Terry model for paired comparisons. Ann. 
Statist. 27 1041-1060. MR1724040 

[51] TSOURAKAKIS, C. (2008). Fast counting of triangles in large real networks: Algo- 
rithms and laws. In Proc. of ICDM 2008 608-617. IEEE Computer Society, Los 
Alamitos, CA. 

[52] Wainwright, M. J. and Jordan, M. I. (2008). Graphical models, exponential fam- 
ilies and variational inference. Foundations and Trends in Machine Learning 1 
1-305. 

[53] WiLLiNGER, W., Alderson, D. and Doyle, J. C. (2009). Mathematics and the 
Internet: A source of enormous confusion and great potential. Notices Amer. 
Math. Sac. 56 586-599. MR2509062 

[54] Wormald, N. C. (1999). Models of random regular graphs. In Surveys in Combi- 
natorics, 1999 (Canterbury). London Mathematical Society Lecture Note Series 
267 239-298. Cambridge Univ. Press, Cambridge. MR1725006 



38 



S. CHATTERJEE, P. DIACONIS AND A. SLY 



S. Chatterjee 

CouRANT Institute of Mathematical Sciences 

New York University 

251 Mercer Street 

New York, New York 10012 

USA 

E-MAIL: sourav@cims.nyu.cdu 

A. Sly 

Theory Group 

Microsoft Research, One Microsoft Way 

Redmond, Washington 98052 

USA 

E-MAIL: allansly@microsoft.com 



P. Diaconis 

Department of Mathematics 

Stanford University 

Stanford, California 94305 

USA 

E-MAIL: diaconis@math. stanford.edu 



